Method and apparatus that determines charged particle beam shape codes

ABSTRACT

A lithography method and apparatus which represent a substrate surface as gray level values and determine a shape data that specifies a shape and position of a flash field. The apparatus receives a pattern in a vector format, represents the substrate surface as a grid of pixels, and then represents each pixel as a gray level value specifying a proportion of the pixel that includes the pattern. Subsequently the apparatus constructs a matrix of a quadrant of four pixels and surrounding pixels, modifies the matrix so that three intermediate shapes corresponding to an exposed region of the quadrant may be provided, determines an intermediate shape data of the quadrant; and performs a reverse modification on the shape to determine the shape data that specifies a flash field.

COPYRIGHT NOTICE

A portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever.

COMPUTER PROGRAM LISTING APPENDIX

An Appendix filed herewith on two identical compact discs (Copy 1 and Copy 2) each containing one computer program listing constitutes a part of the specification pursuant to 37 CFR §§1.77 and 1.96. Each compact disc contains one computer program listing named “ComputerProgramListing” created on the compact discs on Jun. 3, 2002 of file size 15,956 bytes (16,384 bytes on disk) on each compact disc. The entirety of the computer program listing contained in the compact disc is incorporated by reference herein.

BACKGROUND

1. Field of the Invention

The present invention relates to lithography and to electron (or other energy) beam columns and more specifically to a structure and method for determining shape codes of variable shaped beams.

2. Description of the Related Art

It is well known in the field of lithography (pattern generation) that it is desirable to increase the throughput of pattern generation systems. Two main applications for such pattern generation systems are making masks for use in semiconductor fabrication by electron beam lithography and electron beam direct writing of patterns onto wafers to form semiconductor devices.

Lithography systems generate or expose patterns by controlling the flow of energy (the beam) from a source to a substrate coated with a layer sensitive to that form of energy. Pattern exposure is controlled and broken into discrete units commonly referred to as flashes, wherein a flash is that portion of the pattern exposed during one cycle of an exposure sequence. Flashes are produced by allowing energy from the source, for example light, electron or other particle beams, to reach the coated substrate within selected pattern areas. The details of flash composition, dose and exposure sequence used to produce a pattern, and hence the control of the lithographic system, make up what is known as a writing strategy.

A traditional raster scan writing strategy employs a uniform periodic raster scan, somewhat similar to television raster scanning. A mechanical stage moves a substrate, for example placed on a table, uniformly in a direction orthogonal to the direction of the uniform scan of an energy beam. In this manner a pattern is composed on a regular grid with a regular scan trajectory resulting from the orthogonal movement of the stage and beam. When the beam is positioned over a grid site requiring exposure, the beam is unblanked and the underlying site exposed. Only the amount of dose, or energy, at each site is varied as required. Hence, exposure data can be organized in a time sequence corresponding to the regular scan trajectory, and only the dose for each site need be specified. The distinguishing characteristics of a traditional raster scan writing strategy are a small round beam exposing one site at a time, a periodic scan moving sequentially to each site of a grid and a rasterized representation of data corresponding to the required dose for each site or “pixel” of the grid.

On the other hand, in a typical vector scan writing strategy, the beam is positioned only over those sites that require exposure and then unblanked to expose the site. Positioning is accomplished by a combination of stage and beam movement in what is often referred to as a semi-random scan. Thus, data must be provided that includes both the dose and position of each flash or site exposed. Frequently vector scan strategies use a variable shaped beam, that is a beam capable of having a different size and/or shape for each flash. The pattern is then composed from these variable shapes. A shaped beam is capable of exposing multiple pixel sites simultaneously instead of one pixel site at a time as in a raster scan writing strategy. Where a variable shaped beam is used, the data must additionally include the location, size and shape for each flash. Thus the distinguishing characteristics of traditional vector scan writing strategies are a variable shaped and sized beam exposing multiple pixel sites in a single flash, a semi-random scan encompassing only those portions of a pattern to be exposed, and a vectorized representation of data including the location, size, shape and dose of each flash.

Both vector and raster scan writing strategies have advantages and disadvantages. Vector scan strategies can offer fine pattern definition. However, vector scan flash rates are typically slower than raster scan strategies due to settling time required between the relatively large beam deflections of the semi-random scan trajectory. For patterns with exposed portions that are finely detailed, vector scan strategies are relatively slower due to delays in settling of the electron beam shaping components which are capable of shaping the beam over a wide range of dimensions. Also, current density (current per unit area) is generally lower in vector scan strategies due to the need for the electron source to be capable of covering larger areas simultaneously, again leading to lower throughput. A drawback of raster scan writing processes is a relatively coarse pattern definition.

Thus it is desirable to develop an improved writing strategy that combines the advantages of a vector scan strategy, namely, fine pattern definition, with those of a raster scan strategy, namely, increased speed, to increase the throughput of pattern generation systems.

SUMMARY

An embodiment of the present invention provides a flash converter that determines shape data that specifies a flash field among pixels, the pixels being represented as gray level values that represent a proportion of each pixel that overlaps with a pattern, the flash converter including: a reformatter which constructs a matrix of a quadrant and surrounding pixels of the flash field, where the reformatter modifies the matrix so that N intermediate shapes correspond to an exposed region of the quadrant are provided; and a shape code determinator coupled to receive the modified matrix and which determines an intermediate shape data that specifies the quadrant, where the shape code determinator performs a reverse modification on the intermediate shape data and outputs shape data that specifies the flash field. In one embodiment, N is 3.

An embodiment of the present invention provides a method for determining shape data that specifies a flash field among pixels, the pixels being represented as gray level values that represent a proportion of the pixel that overlaps with a pattern, including the acts of: constructing a matrix of a quadrant and surrounding pixels; modifying the matrix so that N intermediate shapes corresponding to an exposed region of the quadrant are provided; determining an intermediate shape data that specifies the quadrant; and performing a reverse modification on the intermediate shape data to determine the shape data that specifies the flash field. In one embodiment; N is 3.

The present invention will be more fully understood in light of the following detailed description taken together with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts a block diagram of a system 100 in accordance with an embodiment of the present invention.

FIG. 2A depicts a block diagram of a suitable rasterizer 102 in accordance with an embodiment of the present invention.

FIG. 2B depicts a flow diagram of a suitable process 200 executed by rasterizer 102 in accordance with an embodiment of the present invention.

FIG. 3 depicts an example portion of a grid 302 that divides a portion of the surface of the substrate, including pattern 306, into pixels 310 in accordance with an embodiment of the present invention.

FIG. 4 depicts an example of a corner of a pattern within a pixel.

FIG. 5A depicts a suitable flash converter 108 in block diagram form in accordance with an embodiment of the present invention.

FIG. 5B depicts a suitable process implemented by flash converter 108 in accordance with an embodiment of the present invention.

FIG. 6 depicts 14 basic shapes, each within a quadrant 304, and associated shape codes in accordance with an embodiment of the present invention.

FIG. 7 shows an enlarged view of portion 308 of pattern 306 within a quadrant.

FIG. 8 depicts a flow diagram of the process of 502 of FIG. 5B in more detail in accordance with an embodiment of the present invention.

FIG. 9 depicts matrix A and matrix B in more detail.

FIG. 10 depicts a process to determine an intermediate shape code in accordance with an embodiment of the present invention.

FIG. 11 depicts an example of unexposed portions of a matrix B for states B-D in accordance with an embodiment of the present invention.

FIG. 12 depicts a schematic diagram of electron beam column 118 in accordance with an embodiment of the present invention.

FIG. 13A depicts a plan view of a portion of upper aperture 1210 in more detail in accordance with an embodiment of the present invention.

FIG. 13B depicts a cross sectional view of upper aperture 1210 of FIG. 13A along line A—A in accordance. with an embodiment of the present invention.

FIG. 14A depicts a plan view of a portion of lower aperture 1214A in accordance with an embodiment of the present invention.

FIG. 14B depicts a cross sectional view of lower aperture 1214A of FIG. 14A along line B—B in accordance with an embodiment of the present invention.

FIG. 15A depicts a plan view of a portion of lower aperture 1214B in accordance with an embodiment of the present invention.

FIG. 15B depicts a cross sectional view of lower aperture 1214B of FIG. 15A along line C—C in accordance with an embodiment of the present invention.

FIG. 16 depicts a suitable implementation and arrangement of conventional upper deflector 1212 and conventional lower deflector 1216 in accordance with an embodiment of the present invention.

FIG. 17A depicts a block diagram of shaper/blanker driver 110 in accordance with an embodiment of the present invention.

FIG. 17B depicts a more detailed block diagram of shaper/blanker driver 110 in accordance with an embodiment of the present invention.

FIG. 18 depicts schematically an example path of electron beam 1222 through upper aperture 1210 and lower aperture 1214A in accordance with an embodiment of the present invention.

FIG. 19A depicts a top plan view of lower aperture 1214A with blanking position 1904 and shape 1908 generated using openings 1402A—1402D.

FIG. 19B depicts a top plan view of lower aperture 1214A with blanking position 1904, and shape 1910 generated using openings 1402A—1402D.

FIG. 20A depicts an example of a shaping of the cross section of electron beam 1222 as shape 1908 of FIG. 19A using opening 1502 of lower aperture 1214B.

FIG. 20B depicts an example of a shaping of the cross section of electron beam 1222 as shape 1910 of FIG. 19B using opening 1502 of lower aperture 1214B.

FIGS. 21-1 through 21-5 are associated with and described in Appendix A.

Note that use of the same reference numbers in different figures indicates the same or like elements.

DETAILED DESCRIPTION System Overview

This disclosure is directed to a system and process for generating and writing electron (or other energy) beams with specified cross sectional shapes directed onto a substrate in a conventional “regular periodic trajectory” raster scan. One embodiment generates electron beams whose largest cross sectional shapes are smaller than that of an electron beam cross section generated by a conventional vector shaped beam apparatus. Thus this embodiment allows for smaller pattern definition than with a conventional vector shaped beam apparatus.

FIG. 1 depicts a block diagram of a lithography (imaging) system 100 in accordance with this embodiment including a rasterizer circuit 102, a buffer circuit 104, a dose value circuitry 106, a flash converter 108, shaper/blanker driver 110, and electron beam column 112 . Flash converter 108 and shaper/blanker driver 110 are each coupled to receive a clock (timing) signal from clock 114. In this example, the clock signal frequency of clock 114 is 800 MHz. For detail of rasterizer circuit 102, buffer circuit 104, dose value circuitry 106, flash converter 108, shaper/blanker driver 110, and electron beam column 112, see below. All dimensions and parameters herein are exemplary.

In this embodiment, rasterizer circuit 102 first receives (e.g. from a conventional lithography data structure) a pattern that is to be written onto a substrate 118 specified by its shape location on the substrate (so called “vector format”). Rasterizer 102 then divides the surface of the substrate 118 into a grid of pixels and represents each pixel as a “gray level value” which specifies a fraction of the pixel's area which includes part of the pattern. Rasterizer 102 outputs each gray level value to both buffer 104 and dose value circuitry 106. (The connecting lines in FIG. 1 inside the dotted lines typically represent multi-line data busses.) Buffer 104 provides gray level values to flash converter 108. In this embodiment, flash converter 108 represents each square arrangement (2 dimensional) of four pixels (“quadrant”) as a flash field that may be exposed in one flash cycle (hereafter the term “flash field” represents a blank or a shape that electron beam column 112 writes onto substrate 118). In other embodiments, flash converter 108 may represent a flash field as an N by M pixel rectangle, a larger or smaller sized square arrangement, or other shape. Flash converter 108 specifies each flash field by shape class and coordinates (shape_x, shape_y) (hereafter the term “shape data” refers to both shape class and coordinates). Dose value circuitry 106 receives gray level values associated with each quadrant from the rasterizer circuit 102 and outputs dose values associated with each flash field.

Shaper/blanker driver 110 requests shape data and corresponding dose values (hereafter shape data and corresponding dose values are referred together as a “flash data”) from respective flash converter 108 and dose value circuitry 106. In one embodiment, flash converter 108 and dose value circuitry 106 provide a flash data to shaper/blanker driver 110 approximately every 10 ns. Shaper/blanker driver 110 converts each flash data to voltage values and provides the voltages to control electron beam column 112 to write the specified flash field in a proper location on the substrate 118. In this embodiment, electron beam column 112 writes a new flash field every 10 ns (hereafter “flash cycle”). For a blank flash field, electron beam column 112 does not write the electron beam onto substrate 118. An ion beam column or other energy beam (e.g., laser) may be substituted for column 112.

Electron beam column 112 writes flash fields in, e.g., a conventional “regular periodic trajectory” raster scan. In this embodiment, a conventional “regular periodic trajectory” raster scan can be a “unidirectional” type or a “bidirectional” type. “Regular periodic trajectory” means that the scan moves uniformly and periodically, and the motion is not controlled by pattern data. In a “unidirectional” type, a scan of a grid begins with, e.g., a lower left corner of the grid defined on a portion of the substrate and proceeds to the top left corner of the grid, then returns to the bottom of the next leftmost column with the beam blanked, and continues to scan the next leftmost column in the same direction as the first, i.e. from bottom to top. Scanning continues in the same manner until the entire grid to be patterned is covered. In a conventional “bidirectional” type raster scan, a scan of a grid begins with, e.g., a lower left corner of the grid defined on a portion of the substrate and proceeds to the top left corner of the grid, and continues to scan the next leftmost column in the opposite direction to the first, i.e. from top to bottom. Scanning continues in such up-then-down order until the entire grid to be patterned is scanned.

Prior art vector scan systems require large data buffers that store vector scanned patterns. Pattern sizes are highly variable and so a data buffer adequate to store pattern data in vector format has a very large storage capacity. However, the large data capacity of the buffer increases cost. The present embodiment employs real-time processing of flash field data to avoid the large data buffer of the prior art. In this embodiment, rasterizer 102, flash converter 108, dose value circuitry 106, and shaper/blanker driver 110 together process the shape and duration of a flash field immediately prior to flash field generation by electron beam column 112.

Rasterizer Circuit 102

In accordance with an embodiment of the present invention, FIG. 2A schematically depicts a suitable rasterizer circuit 102. Rasterizer circuit 102 includes a pixelizer 210 and gray level specifier 212. In one embodiment, rasterizer 102 is “hardwired” logic circuitry that performs a process 200 described in more detail below with respect to FIG. 2B. In other embodiments, rasterizer 102 may be a computer that implements a software form of process 200. All parameters herein are exemplary.

Rasterizer 102 receives as an input signal data in vector format representing a conventional pattern to be written on a substrate. The pattern conventionally defines, e.g., a layer of an integrated circuit, and is expressed in x-y coordinates. Rasterizer 102 decomposes a pattern image to be written onto a portion of the substrate into a grid of pixels and represents each pixel as a gray level value which specifies a fraction of a pixel which includes a pattern.

FIG. 2B depicts a flow diagram of a suitable process 200 executed by rasterizer 102 to represent the pattern image to be written on a surface of the substrate as gray level values.

In 201, pixelizer 210 divides a portion of the substrate surface into a grid. Pixelizer 210 further positions any pattern, where specified, within the grid. In this embodiment, each grid is at most, e.g., 8,192 pixels by 1,440,000 pixels. For imaging a mask requiring a minimum feature size of 200 nm, each pixel is square shaped and approximately 100 nm on a side, although other pixels shapes can be used. FIG. 3 depicts an example portion of a grid 302 that divides a portion of the surface of the substrate, on which is imaged pattern 306, divided into pixels 310.

In 202 of FIG. 2, pixelizer 210 passes the grid to gray level specifier 212, which represents each pixel 310 by a gray level value. The gray level value represents a proportion of subpixels within a pixel 310 that overlap with pattern 306. In this embodiment of the present invention, gray level values range from 0 to 16. For example, a pixel including no pattern 306 has a gray level value of 0. FIG. 4 depicts an example of a corner of a pattern within a pixel. In the example, 64 (those pixels at the edge of the dark area) out of 256 sub-pixels are overlapped thus representing a gray level value of 4. In this embodiment, gray level specifier 212 specifies each gray level value by a 5 bit value thus allowing up to 32 gray levels.

In 203, rasterizer 102 stores the gray level values associated with the grid in buffer 104.

Subsequently, rasterizer 102 repeats process 200 of FIG. 2B until approximately all of the image to be written on the substrate is represented by pixels.

In conventional vector scan apparatuses, only patterns that are to be written on a substrate are coded. In this embodiment, the rasterizer divides a portion of the surface of the substrate into a grid and represents every pixel on the surface. The representation of a full substrate image provides many advantages over conventional vector format representations of patterns, particularly because each pixel, regardless of whether it includes a pattern, is represented.

For example, conventional proximity error corrections can be calculated more readily because each pixel, whether or not exposed, is represented. Proximity error correction involves adjusting a level of electron beam exposure to a particular area of a substrate to avoid overexposure by considering exposure to pixels adjacent to the area. If the patterns are in vector format, determining the proximity of patterns requires numerous calculations.

Overlap determination can be calculated more readily in this embodiment. Overlap determination is required to prevent overexposure of a region where multiple patterns overlap. With patterns in vector format, each pattern is coded separately, even the overlapping patterns. Thus to determine overlap requires numerous calculations. Since every pixel is represented by gray level values in this embodiment, it is much easier to determine overlap.

Sometimes patterns must be “tone reversed”, i.e. normally unexposed portions of a substrate are exposed and normally exposed pattern portions are not to be exposed. In this embodiment, pixels can easily be tone reversed because even the normally unexposed pixels are represented. For patterns in vector format, only exposed areas are coded, so it is difficult to tone reverse the unexposed areas.

With vector format patterns, the number of flash fields in a pattern may be enormous and therefore requires an impractically large buffer space. In this embodiment each pixel is represented individually, so the pixels can be loaded in a buffer space in discrete steps, even partitioning a pattern. In this embodiment, the rasterizer 102 outputs gray level values to buffer device at a constant rate, thereby allowing buffer 104 to include less storage space than in the prior art.

Flash Converter 108

Flash converter 108 converts gray level values of pixels into shape data that specify the shape of a flash field. FIG. 5A depicts a suitable flash converter 108 in block diagram form. As shown, flash converter 108 includes reformatter logic 510, shape code determination logic 512, first lookup table (LUT) 514, and second LUT 516. A suitable implementation of first and second LUTs 514 and 516 is static random access memory (SRAM). Flash converter 108 is coupled to receive gray level values from buffer 104 and a clock signal from clock 114. Flash converter 108 outputs shape data to shaper/blanker driver 110 in accordance with clock signal from clock 114.

In this embodiment, flash converter 108 is hardwired logic that performs process 500 discussed below with respect to FIG. 5B. In other embodiments, flash converter 108 may be a computer that implements a software form of process 500. Appendix B is a Pascal computer language simulation of process 500 as carried out by the hardwired logic implementation of flash converter 108. All parameters herein are exemplary.

In 501, buffer 104 provides a signal having, e.g., at least 16 gray level values to reformatter logic 510. In this embodiment the gray level values correspond to a square formation of 4 by 4 pixels (hereafter “matrix A”) with a quadrant of interest in the center. FIG. 9 depicts matrix A in more detail. The quadrant of interest corresponds to pixels a22, a23, a32, and a33 and the remaining pixels are “surrounding pixels”. For quadrants on the edges of a grid, pixels surrounding the quadrants that are not within the grid have a gray level value of 0. Flash converter 108 represents the quadrant of interest as shape data.

In a first execution of 501, buffer 104 first outputs the data pertaining to the bottom left hand quadrant. In subsequent executions of 501, buffer 104 outputs data pertaining to quadrants in the raster scan described earlier.

In 502, reformatter logic 510 represents the quadrant by shape data, i.e., a shape code and coordinates (shape_x, shape_y). The shape code represents a basic shape that ranges from a quadrant that is to be fully exposed by the beam to a non-exposed quadrant. The coordinates modify basic shapes by subpixels so that the modified shape better approximates the portion of a pattern within the quadrant. The shape of each flash field is specified by a shape code and coordinates.

FIG. 6 depicts 14 basic shapes, each within a quadrant 304, and associated assigned shape codes in accordance with this embodiment. The dark portion of each shape represents an area that is to be exposed by the (electron) beam (“exposed area”). The largest exposed area is a full quadrant and corresponds to shape code 16. Shape codes 1 to 4 specify rectangular shaped exposed areas with four different rotations. Shape codes 5 to 8 specify exposed areas that can be either square or rectangular shaped with four different rotations. Shape codes 9 to 12 represent L-shaped exposed areas with four different rotations. In other embodiments, shape codes can represent other shapes.

The coordinates modify shapes corresponding to shape codes 1 to 12 by sub-pixels. In this embodiment, shape_x and shape_y each take on values 0 to 31. For example, FIG. 7 shows an enlarged view of portion 308 of pattern 306 within a quadrant. Portion 308 corresponds to a shape code of 12. In the example, alteration of the shape with shape code 12 is accomplished by specifying coordinates shape_x and shape_y of (19,20). In the example, the coordinates specify the corner position of the shape.

FIG. 8 depicts a flow diagram of the process of 502 in more detail. In 801, reformatter logic 510 loads matrix A.

In 802, reformatter logic 510 modifies matrix A so that pixel a22 has the highest gray level value among the quadrant of interest by any or all of three operations: 1) rotation, 2) flipping, or 3) reverse toning. Variable “rotate” takes on values of 0, 1, 2, or 3, and represents whether matrix A has been rotated respectively by either 0, 90, 180, or 270 degrees. Variable “flip” specifies whether the matrix A has been “flipped”, i.e., for each pixel, exchanging coordinates shape_x and shape_y but retaining the gray level value. Variable “reverse” specifies whether to represent each gray level value as 16 minus its gray level value. Reformatter logic 510 stores the variables and records the order of operations for later use. The resulting matrix is matrix B, shown in FIG. 9. Reformatter logic 510 then outputs the matrix B and the variables and the order of operations to shape code determination logic 512.

In 803, shape code determination logic 512 applies a process shown in FIG. 10 to the center four pixels of matrix B to determine an intermediate shape code. First, in 1002, shape code determination logic 512 determines if the pixel b23 has a gray level value of 16. If not, in 1003, the intermediate shape code is either 5, 1, or 11 (state D). If so, then in 1004, shape code determination logic 512 determines if pixel b32 has a gray value of 16. If not, in 1005, the intermediate shape code is either 1 or 11 (state C). If so, then in 1006, shape code determination logic 512 determines if pixel b33 has a gray value of 16. If not then in 1007, the intermediate shape code is 11 (state B). If so, then in 1008, the intermediate shape code is 16 (state A).

Thus if the state is C or D, shape code determination logic 512 subsequently determines intermediate coordinates and an intermediate shape code. For state B, shape code determination logic 512 subsequently determines only intermediate coordinates by which to modify the shape corresponding to the intermediate shape code of 11. For state A, no coordinates need be determined.

In this embodiment, for state B, to determine intermediate coordinates, shape code determination logic 512 provides gray level values of pixels shown in Table 1 to first lookup table 514 which in turn outputs the corresponding intermediate coordinates. For states C-D, to determine intermediate coordinates and an intermediate shape code, shape code determination logic 512 outputs gray level values of pixels shown in Table 1 to first lookup table 514 which in turn outputs the intermediate coordinates and intermediate shape code.

TABLE 1 State pixels considered B b33, b34, b43, and b44 C b31, b32, b33, b34, and b42 D b22, b23, b32, b33, and b42

For state B, the intermediate coordinate entries in first lookup table 514 are derived as follows. Pixels b22, b23, and b32 have gray level values of 16. The gray level value of pixel b33 is specified but the exposed sub-pixels are not specified. Thus a single gray level value could correspond to a variety of exposed sub-pixels. In this embodiment it is assumed that a corner is formed through pixels b34, b33, and b43 (broken lines in matrix 1102 of FIG. 11), with the intersection of the sides of the corner at edge point 1118 in pixel b33. The interior shaded region of corner 1108 is not exposed. The angle between the sides intersecting edge point 1118 is not necessarily 90 degrees. In this embodiment, the coordinates of the edge point 1118 and hence the exposed portion of pixel b33 is estimated by determining a corner 1108 through pixels b34, b33, and b43 that corresponds to a minimum gray level error. Specifically, each coordinate in first lookup table 514 corresponds to a minimum error between 1) the specified gray level values of the flash field and pixels b34, b43, and b44 and 2) the gray level values of the resulting flash field and the pixels b34, b43, and b44 with the edge point set at the coordinate.

In this embodiment, the formula given below is calculated for each coordinate from (16,16) to (31,31) for all possible gray level values. Each intermediate coordinate in first lookup table 514 corresponds to the minimum error value from the following formula:

Error=K(F)+L(PE)+M(T)  (1)

Variable F represents the absolute value of the difference between the specified total gray level value of the flash field and the total gray level value of the flash field achieved by a coordinate.

Variable PE represents a maximum error between each gray level value of each pixel in Table 1 for state B and the gray level values of the pixels in Table 1 for state B achieved by a coordinate.

Variable T represents the sum of variable F and the sum of absolute differences between each specified gray level value of each pixel in Table 1 for state B and the gray level values of the pixels in Table 1 for state B achieved by a coordinate.

In one embodiment, weighting variables K, L, and M are respectively 8, 4, and 1. This weighs most heavily the variable F.

For state C, the intermediate coordinate entries in first lookup table 514 are derived as follows. Pixels b22 and b23 have gray level values of 16. The gray level values of pixels b32 and b33 are specified but the exposed sub-pixels are not. Since the exposed portion of the quadrant can correspond to shape code 1 or 11, either a corner is formed through pixels b32, b33, b34, and b42 or a straight edge is formed through pixels b32, b33, b34, and b31 (broken lines in matrix 1104 of FIG. 11). The angle between sides of the corner can be other than 90 degrees. For the corner that corresponds to a shape code of 11, intermediate coordinates specify the intersection of sides at edge point 1120 in pixel b32. The unexposed portion of the corner is shown as region 1110 and the unexposed portion of the straight edge is shown as regions 1110 and 1112 together.

For state C, the intermediate shape codes and intermediate coordinates for each combination of gray level values for pixels b31, b32, b33, b34, and b42 specified in the lookup table correspond to minimum error values from the formula. The formula discussed earlier is calculated for each of shape codes 1 and 11 and coordinates ranging from (0,16) to (31,31). In the formula, variable PE represents a maximum error between each gray level value of each pixel in Table 1 for state C and the gray level values of the pixels in Table 1 for state C achieved by a coordinate. Variable T represents the sum of variable F and the sum of absolute differences between each specified gray level value of each pixel in Table 1 for state C and the gray level values of the pixels in Table 1 for state C achieved by a coordinate.

For state D, the intermediate coordinate entries in first lookup table 514 are derived as follows. The gray level values of pixels b22, b23, b32, and b33 are specified but the exposed sub-pixels are not. As the exposed portion of the quadrant can correspond to shape code 1, 5, or 11, either a corner is formed through pixels b42, b32, b22, b23, and b24 or a straight edge is formed through pixels b21, b22, b23, and b24 (broken lines in matrix 1106 of FIG. 11). The unexposed portion of the corner is shown as region 1116 and the unexposed portion of the straight edge is shown as regions 1114 and 1116 together. For the corner that corresponds to a shape code of 11 or 5, coordinates specify the intersection of sides at edge point 1122 through any pixel of the quadrant. The angle between sides of edge point 1122 can be other than 90 degrees. For example, a shape code 5 could correspond to the angle between sides of corner of more than 90 degrees.

For state D, the intermediate shape codes and intermediate coordinates for each combination of gray values for pixels b22, b23, b32, b33, and b42 specified in first lookup table 514 correspond to minimum error values from the formula. The formula discussed earlier is calculated for each of shape codes 1, 5, and 11 and coordinates ranging from (0,0) to (31,31). In the formula, variable PE represents a maximum error between each gray level value of each pixel in Table 1 for state D and the gray level values of the pixels in Table 1 for state D achieved by a coordinate. Variable T represents the sum of variable F and the sum of absolute differences between each specified gray level value of each pixel in Table 1 for state D and the gray level values of the pixels in Table 1 for state D achieved by a coordinate.

In 804 of FIG. 8, for all states, shape code determination logic 512 reverses any modification performed by reformatter logic 510 in 802 in a reverse order on the shape specified by shape code and coordinates determined in 803. In this embodiment, shape code determination logic 512 accesses second lookup table 516 that includes coordinates and shape codes for every combination of reverse-transformation operations on every possible intermediate shape code and intermediate coordinates, i.e., shape codes 1, 5, or 11 and coordinates (0,0) to (31,31). In 805, shape code determination logic 512 reads the appropriate shape code and coordinates from second lookup table 516.

In 503 of FIG. 5B, shape code determination logic 512 provides shape data to shaper/blanker driver 110. In this embodiment, flash converter 108 provides shape data to shaper/blanker driver 110 approximately every 10 ns.

Flash converter 108 repeats steps 501 to 503 for each quadrant in the grid until all quadrants in the grid specified in 202 of FIG. 2B are represented by shape data.

Advantageously this embodiment allows for a reduced number of look up table entries and circuitry necessary in load operations. First lookup table 514 includes coordinate entries for three shape codes, i.e. 1, 5, and 11. In this embodiment, 17⁴ values are needed in a look up table for state B and 17⁵ values are needed in a look up table for each of states C and D. Otherwise, lookup tables would be required for each of shape codes 1-12. Thus this embodiment reduces the number of costly lookup tables.

In this embodiment, each lookup table value requires 2 bytes, which includes a 5 bit shape_x coordinate and 5 bit shape_y coordinate value and a 5 bit shape code. In this embodiment each of first lookup table 514 and second lookup table 516 require approximately 6 megabytes.

Dose Value Circuitry 106

In one embodiment, dose value circuitry 106 receives gray level values among and around a flash field from rasterizer 102, selects three dose values, “dose1”, “dose2”, and “dose3”, from a programmable lookup table, and outputs the dose values to shaper/blanker driver 110. Dose value entries associated with a shape data depend on arrangements and magnitudes of gray level values among the flash field represented by the shape data. In other embodiments, more or fewer dose values are associated with a flash field. Variable “dose1” specifies a level of conventional long range correction. Variable “dose2” specifies a level of conventional short range correction. Variable “dose3” specifies a level of gray level splicing correction. A suitable technique for generating dose values associated with each flash field is described in U.S. patent application Ser. No. 08/789,246, filed Jan. 28, 1997, now U.S. Pat. No. 5,847,959, commonly assigned, and Appendix A hereto, “Run-Time Correction of Proximity Effects in Raster Scan Pattern Generator Systems,” L. Veneklasen, U. Hofmann, L. Johnson, V. Boegli, and R. Innes, presented at Micro- and Nano-Engineering 98, Leuven, Belgium, Sep. 22-24, 1998, both incorporated herein by reference in their entireties.

A suitable dose value circuitry 106 includes hardwired logic and a conventional memory such as static random access memory. In other embodiments, dose value circuitry 106 may be a computer that executes suitable software. All parameters herein are exemplary.

In this embodiment, dose value circuitry 106 provides dose values associated with each shape data to shaper/blanker driver 110 every 10 ns. Shaper/blanker driver 110, described in more detail below, translates dose values associated with each shape data into exposure times that specify the duration of time of a flash field, i.e. the time an area of the substrate is exposed to the beam.

Electron Beam Column 112

FIG. 12 depicts schematically a suitable novel electron beam column 112 that generates flash fields specified by shape data in a raster scan. In this embodiment, electron beam column 112 generates flash fields by a “shadow projection” technique discussed in more detail below. Electron beam column 112 includes a conventional thermal field emission (TFE) electron source 1204, a conventional electron beam transfer lens 1206, upper aperture 1210, a conventional upper deflector 1212, lower aperture 1214, a conventional lower deflector 1216, conventional magnetic deflection coils 1218, and a conventional electron beam objective lens 1220. Electron beam column 112 writes flash fields on substrate 118.

All dimensions and parameters herein are exemplary. In other embodiments, electron beam column 112 could generate charged particle beams or other energy beams.

FIG. 12 illustrates an apparatus used for writing patterns with a minimum feature dimension of 200 nm on a mask. The apparatus of course can be altered for different minimum feature sizes. The maximum cross sectional beam size generated by electron beam column 112 corresponds to the minimum feature size of the resulting pattern.

Conventional thermal field emission (TFE) electron source 1204 outputs electron beam 1222. TFE electron source 1204 provides a current per unit solid angle, dI/dΩ, otherwise known as angular intensity, of at least 1.0 mA/steradian. TFE electron source 1204 outputs electron beam 1222 from approximately 420 mm above the surface of substrate 118.

Conventional transfer lens 1206 is positioned downstream with regard to the electron beam direction from TFE electron source 1204 (hereafter “downstream” means downstream with regard to the electron beam direction from TFE electron source 1204). Transfer lens 1206 is approximately 320 mm upstream from the surface of substrate 118. Conventional transfer lens 1206 focuses electron beam 1222 at crossover point 1230, approximately 1 mm downstream from the center, point C, of lower aperture 1214 described in more detail later.

Upper aperture 1210 is positioned downstream from transfer lens 1206. Upper aperture 1210 is positioned approximately 290 mm upstream from substrate 118. Upper aperture 1210 defines a square opening 1302 of approximately 135 μm by 135 μm. When upper aperture 1210 is illuminated by TFE electron source 1204, a well resolved shadow of square opening 1302 that corresponds to the cross section of the electron beam 1222 is projected downstream from upper aperture 1210. As shown in FIG. 12, the size of a cross section of the shadow of electron beam 1222 decreases downstream from upper aperture 1210 to the crossover point 1230.

FIG. 13A depicts a plan view of a portion of upper aperture 1210 in more detail. In this embodiment, upper aperture 1210 defines a square opening 1302 of approximately 135 μm by 135 μm. Square opening 1302 is coaxial with electron beam 1222. FIG. 13B depicts a cross sectional view of upper aperture 1210 of FIG. 2A along line A—A. The thickness of upper aperture 1210 is approximately 10 μm.

In this embodiment, upper aperture 1210 is constructed by depositing a low stress refractory metal such as tungsten-titanium alloy on a silicon membrane and then patterning 135 μm by 135 μm square opening 1302 through both the alloy and the silicon membrane by use of a focused ion beam. In another embodiment, upper aperture 1210 is made of an approximately 10 μm thick foil of metal such as molybdenum, tungsten, or an alloy such as molybdenum-rhenium which may be heated by an electrical current to reduce contamination problems.

Referring to FIG. 12, conventional upper deflector 1212 is positioned downstream from upper aperture 1210. The operation and a suitable structure of upper deflector 1212 is described in more detail below.

Lower aperture 1214 is positioned downstream from upper deflector 1212. When lower aperture 1214 is illuminated by electron beam 1222, a well resolved shaped beam is further defined by the portion of the shadow of the opening defined by upper aperture 1210 that passes through the lower aperture 1214. As shown in FIG. 12, the size of a cross section of the shadow of electron beam 1222 decreases downstream from lower aperture 1214 to the crossover point 1230 and then increases downstream from crossover point 1230.

Electron beam 1222 converges to the crossover point 1230 close to lower aperture 1214. When electron beam 1222 impinges lower aperture 1214, the size of the cross section of electron beam 1222 is very small. The small cross sectional size in turn involves use of small shaper openings in lower aperture 1214. The cross sectional size of electron beam 1222 as electron beam 1222 impinges lower aperture 1214 can be adjusted by moving the crossover point 1230, which involves changing the strength of the transfer lens 1206.

In this embodiment, lower aperture 1214 is, e.g., either lower aperture 1214A (FIG. 14A) or lower aperture 1214B (FIG. 15A). FIG. 14A depicts a plan view of a portion of lower aperture 1214A. As depicted, lower aperture 1214A includes four openings 1402A-1402D. Each short side 1412 of each opening 1402A-1402D has a length A of approximately 3 μm. As depicted, the angle between each short side 1412 is 90°. The narrow distance X between each opening 302 is approximately 3 μm. FIG. 14B depicts a cross sectional view of lower aperture 1214A of FIG. 14A along line B—B. The thickness, T, of lower aperture 1214A is approximately 10 μm.

In this embodiment, lower aperture 1214A is constructed by depositing a low stress refractory metal such as a tungsten-titanium alloy on a silicon membrane and then patterning the four opening sections 1402A-1402D through both the metal and silicon membranes by use of a focused ion beam. In another embodiment, lower aperture 1214A is made of an approximately 10 μm thick foil of metal such as molybdenum, tungsten, or an alloy such as molybdenum-rhenium which may be heated by an electrical current to reduce contamination problems.

FIG. 15A depicts a top plan view of a portion of an alternative lower aperture 1214B. As shown, lower aperture 1214B includes a cross-shaped opening 1502. Each of the 12 sides 1508 of the cross-shaped opening is approximately 3 μm. As depicted, the angle between each side 1508 is 90°. FIG. 4B depicts a cross section of lower aperture 1214B of FIG. 4A along line C—C. The thickness of lower aperture 1214B is approximately 10 μm.

In this embodiment of the present invention, lower aperture 1214B is constructed by depositing a low stress refractory metal such as tungsten-titanium alloy on a silicon membrane and then patterning cross-shaped opening 1502 through both the metal and silicon membrane. In another embodiment, lower aperture 1214B is made of an approximately 10 μm thick foil of metal such as molybdenum, tungsten, or an alloy such as molybdenum-rhenium which may be heated by an electrical current to reduce contamination problems.

Upper aperture 1210, as depicted in FIG. 13A, and lower aperture 1214A of FIG. 14A or lower aperture 1214B of FIG. 15A are coaxially aligned along an axis descending from the tip of electron source 1204 through centerpoint C shown in FIGS. 13A, 14A, and 15A.

The L-shaped or cross-shaped openings in lower aperture 1214 allow electron beam 1222 to define an edge, an exterior corner, or an interior corner, anywhere within a flash field. Thus, edges and corners in a pattern can be placed in much smaller increments as required in semiconductor device fabrication.

Conventional lower deflector 1216 is positioned downstream from lower aperture 1214. The operation and a suitable structure of lower deflector 1216 is described in more detail later.

FIG. 16 depicts a suitable implementation and arrangement of conventional upper deflector 1212 and conventional lower deflector 1216. Conventional upper deflector 1212 includes four metal plates 1602 arranged in a square shaped formation coupled to receive voltages at nodes 1606, 1608, 1604, and 1610. Similarly, conventional lower deflector 1216 includes four metal plates 1602 arranged in a square shaped formation coupled to receive voltages at nodes 1618, 1614, 1616, and 1612. In this embodiment, nodes of upper deflector 1212 and lower deflector 1216 are coupled to receive voltages from shaper/blanker driver 110. The operation of upper deflector 1212 and lower deflector 1216 are described in more detail later.

Conventional deflection coils 1218 are positioned downstream from the lower deflector 1216. Conventional deflection coils scan electron beam 1222 across the substrate 118 in a conventional raster scan. In this embodiment the length.of the scan is up to 1 mm.

In accordance with the conventional raster scan, substrate 118 is positioned on a conventional stage which moves substrate 118 in a direction perpendicular to the direction of the raster scan and within the plane of substrate 118.

Conventional objective lens 1220 is positioned besides deflection coils 1218, that is, approximately within the same plane perpendicular to the direction of electron beam. Objective lens 1220 effectively controls the size of the electron beam shadow from lower aperture 1214 written onto substrate 118. An operation of objective lens 1220 is discussed in more detail below.

In this embodiment, shadow projection involves use of a small, high brightness TFE source to obtain a high current density in the shadow, e.g. up to 3000 amperes per square centimeter in the shaped beam as well as small crossover, i.e., the beam cross section is small at crossover point 1230 compared to the size of the shape shadow in plane 1806. The use of small openings in lower aperture 1214 allows for small deflection angles by upper deflector 1212, which in turn allows for relatively low deflection voltages. Low deflection voltages allow for a high rate of shaped beam generation. The present small shape size and small required deflection voltages also allow for short settling time for each shaped flash, e.g. less than 3 ns, which further facilitates a higher throughput than possible with conventional vector shaped beam apparatuses.

The shadow projection shaping also allows the use of a relatively short beam path which reduces electron-electron interactions that would otherwise cause a blurred image of the shaped beam on substrate 118.

The TFE electron source is less appropriate to use in a conventional vector shaped beam apparatus because it can not supply sufficient beam current for the larger shapes required.

Shaper/Blanker Driver 110

Shaper/blanker driver 110 controls the shape and duration of flash fields that electron beam column 112 writes onto substrate 118 by providing voltages to upper deflector 1212 and lower deflector 1216 of electron beam column 112.

FIG. 17A depicts a block diagram of shaper/blanker driver 110 that includes translator 1720, output device 1722, timer 1708, and retrograde scan device 1710. As discussed above, shaper/blanker driver 110 requests and receives flash data, i.e. shape data and corresponding dose values from respective flash converter 108 and dose value circuitry 106. Translator 1720 receives flash data and converts the shape data and corresponding dose values into respective voltage values and an exposure time. Translator 1720 provides exposure time to timer 1708 and provides voltage values to output device 1722. Output device 1722 converts voltage values into voltage signals and provides the voltage signals to deflectors of electron beam column 112. Timer 1708 controls the duration that output device 1722 outputs voltage signals according to the exposure time. Retrograde scan device 1710 applies a retrograde scan, discussed in more detail below, to the voltage signal applied to lower deflector 1216.

FIG. 17B depicts a more detailed block diagram of shaper/blanker driver 110. In FIG. 17B, translator 1720 includes shape lookup table 1702 and dose lookup table 1704; output device 1722 includes multiplexers (MUXs) 1706A-1706D, digital to analog converters (DACs) 1712A-1, 1712A-2, 1712B-1, 1712B-2, 1712C-1, 1712C-2, 1712D-1, and 1712D-2, amplifiers 1714A-1, 1714A-2, 1714B-1, 1714B-2, 1714C-1, 1714C-2, 1714D-1, and 1714D-2, and blanking voltage register 1724.

For each input shape data, shape lookup table 1702 outputs four voltage values to MUXs 1706A-1706D. Two voltage values provided to MUXs 1706A and 1706B specify a 2 dimensional electric field deflection by upper deflector 1212 that effectively control a shaping of the electron beam cross section by controlling a location that electron beam intersects lower aperture 1214. Two voltage values provided to MUXs 1706C and 1706D specify a 2 dimensional electric field deflection by lower deflector 1216 that effectively offsets any deflection by upper deflector 1212 and positions the shaped electron beam on an intended portion of substrate 118.

In this embodiment, the location at which the electron beam intersects the lower aperture 1214 is adjustable by 4096 incremental distance units in either the horizontal or vertical direction within the plane of the lower aperture 1214. In this embodiment, each incremental unit is approximately 12/4096 μm. The fine incremental positioning allows for offsetting fine errors due for example to variations in an opening defined by lower aperture 1214 over time. In one embodiment, each voltage value is a 12 bit value.

An exemplary implementation of shape lookup table 1702 includes conventional static random access memory. In this embodiment, shape lookup table 1702 is readily programmable. This is necessary because suitable shape lookup table entries, i.e. voltage values to shape the electron beam cross section as desired, may vary. The voltage values in shape lookup table 1702 may need to be changed over time for an electron beam column because the characteristics of electron beam column 112 change over time. For example, the openings defined by the apertures may change over time due to wear. Also, the voltage values for a specific flash field may vary between various electron beam columns.

An exemplary implementation of dose lookup table 1704 includes a conventional static random access memory. Dose lookup table 1704 outputs an exposure time associated with dose values to the timer 1708. As stated above, an exposure time specifies a time that the deflectors of electron beam column 112 deflect its electron beam. In this embodiment the exposure time value is a 9 bit value and can specify at most 10 ns. In this embodiment, dose lookup table 1704 is readily programmable for similar reasons as discussed with respect to shape lookup table 1702.

Timer 1708 receives exposure time values from dose lookup table 1704 and further receives the clock signal of system clock 114 of FIG. 1. Timer 1708 outputs a binary output signal to toggle outputs of MUXs 1706A-1706D. Timer 1708 outputs a positive binary signal to MUXs 1706A-1706D for the number of clock cycles specified by each exposure time value and a negative binary signal to MUXs 1706A-1706D otherwise. A suitable implementation for timer 1708 is emitter coupled logic circuit.

In this embodiment, timer 1708 further requests flash converter 108 and dose value circuitry 106 to begin providing flash data, i.e. shape data and dose values, to shaper/blanker driver 110. In this embodiment, timer 1708 provides a first request to begin the flow of a column of flash data from buffer 1204 and repeats such request after timer receives a column of flash data. In this embodiment a column corresponds to 4096 flash data, and timer 1708 provides a request every approximately 40.96 ms.

Further operation of timer 1708 is described with respect to position adjuster 116.

MUXs 1706A-1706D are each a conventional multiplexer that receives multiple input signal and provides a single output signal in response to a control signal. A first input signal to MUXs 1706A-1706D is the set of four voltage values from the shape lookup table 1702. A second input signal is a set of four voltage values that correspond to a beam blanking position from blanking voltage register 1724. The binary output signal of timer 1708 controls the output signal of MUXs 1706A-1706D. Thus in this embodiment, during a 10 ns flash cycle, for a time specified by exposure time, MUXs 1706A-1706D output the four voltage values from shape lookup table 1702 and during the remaining time the MUXs output voltage values that blank the electron beam. In most cases the voltage values that blank the electron beam are zero although they may be adjusted to minimize dose error in a blanking operation. As depicted in FIG. 17B, MUXs 1706A-1706D provide their outputs to respective “shaper” DACs 1712A-1, 1712A-2, 1712B-1, 1712B-2, 1712C-1, 1712C-2, 1712D-1, and 1712D-2.

Conventional DACs 1712A-1, 1712A-2, 1712B-1, 1712B-2, 1712C-1, 1712C-2, 1712D-1, and 1712D-2 convert the voltage values into analog voltage signals. In this embodiment, these DACs essentially multiply each 12 bit binary voltage value by a conversion ratio of 0.5 V/2¹². In this embodiment, the maximum voltage output of a DAC is approximately 0.5 V peak-to-peak. DACs 1712A-1, 1712A-2, 1712B-1, and 1712B-2 provide analog voltages to respective conventional amplifiers 1714A-1, 1714A-2, 1714B-1, and 1714B-2, which provide voltages to upper deflector 1212. DACs 1712C-1 and 1712C-2 provide analog voltages to respective conventional amplifiers 1714C-1 and 1714C-2 which provide voltages to lower deflector 1216. DACs 1712D-1 and 1712D-2 provide analog voltages to respective conventional voltage adders 1716A and 1716B, which provide voltages, as modified by signals from retrograde scan device 1710 described in more detail below, to lower deflector 1216.

Retrograde scan device 1710 adjusts voltages provided to lower deflector 1216 to offset the movement of the position of the beam on substrate 118 during the raster scan discussed above (so called “retrograde scan”). The retrograde scan prevents the electron beam column 112 from spreading a flash field beyond its intended area. In this embodiment, retrograde scan device 1710 outputs binary values that increase or decrease in value in a stair case fashion to conventional DACs 1712E-1 and 1712E-2. In this embodiment, each of the staircase corresponds to approximately 200/8 nm of offset to the position of a flash field on the substrate. In one embodiment, retrograde scan device 1710 provides eight steps per flash cycle, i.e. 10 ns. Whether binary values increase or decrease depends on the direction of the raster scan sweep. The staircase signal is subsequently filtered (not depicted) to remove the third harmonic thus creating an approximately saw-tooth waveform with a same period as the staircase signal. Retrograde scan device 1710 adds values for a raster scan sweep up a column, i.e. 4096 flash fields arranged in a line, and subtracts values going down a column.

DACs 1712E-1 and 1712E-2 in turn output analog voltage representations of the binary values to respective voltage adders 1716A and 1716B. Voltage adders 1716A and 1716B add voltages provided by DACs 1712D-1, 1712D-2, 1712E-1, and 1712E-2 and output the sum of the voltages to respective conventional amplifiers 1714D-1 and 1714D-2.

In this embodiment, conventional amplifiers 1714A-1, 1714A-2, 1714B-1, 1714B-2, 1714C-1, 1714C-2, 1714D-1, 1714D-2, 1712E-1, and 1712E-2 output signals that are each 10 times the magnitude of the input signals. Amplifiers 1714A-1, 1714A-2, 1714B-1, and 1714B-2 output voltages to respective nodes 1606, 1608, 1604, and 1610 of upper deflector 1212. Amplifiers 1714C-1, 1714C-2, 1714D-1, and 1714D-2 output voltages to respective nodes 1618, 1614, 1612, and 1616 of lower deflector 1216.

In the prior art, retrograde scan circuitry is separate from circuitry that generates voltages to the deflectors. Advantageously, in this embodiment, by combining the retrograde scan capability into the shaper/blanker driver, the length of electron beam column 118 can be shorter than that of a prior art electron beam column. A shorter electron beam column allows use of less current to generate a flash field, which allows for faster generation of flash fields.

Position Adjuster 116

In the conventional raster scan described above, substrate 118 is positioned on a conventional stage which moves substrate 118 in a direction perpendicular to the direction of the raster scan and within a plane. A conventional position adjuster circuit 116 compensates for the horizontal movement of substrate 118 on the conventional stage. Position adjuster circuit deflects the direction of the incident electron beam by using an electric field so that electron beam column 112 writes a flash field in a proper location. The adjustment is similar to the retrograde scan described above.

Timer 1708 of shaper/blanker driver 110 communicates the approximate movement of substrate 118 to position adjuster 116. Timer 1708 provides a signal indicating when electron beam column 112 has completed writing a column of flash fields. In one embodiment, the substrate moves horizontally approximately 200 nm every 40.96 ms, i.e. a column width.

An Example Operation of Electron Beam Column 112

The following is an example of operation of electron beam column 112 during a single flash cycle. FIG. 18 depicts an example of a shaping action as electron beam 1222 traverses upper aperture 1210 and lower aperture 1214. TFE electron source 1204 emits electron beam 1222 (not depicted). Transfer lens 1206 (not depicted) focuses electron beam 1222 at crossover point 1230, approximately 1 mm downstream from lower aperture 1214. When upper aperture 1210 is illuminated by TFE electron source 1204, a well resolved shadow of square opening 1302 that corresponds to the cross section of the electron beam 1222 is projected downstream from upper aperture 1210. Initially, there is no voltage on upper deflector 1212 so electron beam 1222 intersects a solid portion of lower aperture (so called “beam blanking operation”).

Subsequently flash converter 108 and dose value circuitry 106 provide a flash data, i.e. shape data and dose values, to shaper/blanker driver 110 which applies resultant voltages to upper deflector 1212 and lower deflector 1216. Upper deflector 1212 then changes the direction of electron beam 1222 to impinge on the opening defined in lower aperture 1214 to shape the electron beam cross section as specified by the shape data. A shadow of the shaped electron beam cross section appears at site 1804 in plane 1806, downstream from lower aperture 1214. Plane 1806 is parallel to the plane of lower aperture 1214 and approximately 0.6 mm downstream from lower aperture 1214.

Lower deflector 1216 applies an electric field which changes the direction of the shaped electron beam 1222 such that the shadow at site 1804 appears to be positioned at site 1808 as viewed from substrate 118 (not depicted). Thus the lower deflector 1216 allows shaping without substantial shift of the beam's position on the substrate 118. As stated earlier, lower deflector 1216 also applies an electric field that provides a retrograde scan described above. Objective lens 1220 (not depicted) focuses on substrate 118 the shadow of the shaped electron beam 1222 at site 1808. The duration of exposure of the flash field is specified by timer 1708 of shaper/blanker driver 110. When the exposure of the flash field is complete, the beam returns to the blanked position, e.g. in the center of the lower aperture 1214A.

It is noted that in this example, the angle by which the upper deflector changes the direction of the beam's central axis 1808 (θ_(deflection)) is much smaller than the divergence angle (θ_(aperture)) of the electron beam.

The following describes an example of shaping the cross section of electron beam 1222 by lower aperture 1214A. FIGS. 19A and 19B each depict a plan view of openings 1402A-1402 D of lower aperture 1214A and a blanking position 1904 located in a solid portion of lower aperture 1214A. FIGS. 19A and 19B each further depict electron beam shapes (in cross section) 1908 and 1910 generated using respective openings 1402A and 1402C of lower aperture 1214A. Electron beam 1222 first intersects blanking position 1904 (the “beam blanking operation”). To generate shape 1908, which may correspond for example to shape class 5 and coordinates of (20, 25), upper deflector 1212 directs square shaped electron beam 1222 from blanking position 1904 to impinge on area 1902 so that the cross section of the portion of electron beam 1222 which traverses the lower aperture 1214B matches shape 1908. To generate shape 1910, which may correspond for example to shape class 10 and coordinates of (15, 25), upper deflector 1212 directs square shaped electron beam 1222 from blanking position 1904 to impinge on area 1906 so that the cross section of electron beam 1222 that traverses lower aperture 1214B matches shape 1910.

FIGS. 20A and 20B each depict an example of a shaping of the cross section of electron beam 1222 as shapes 1908 and 1910 using opening 1502 of lower aperture 1214B. To generate shape 1908, upper deflector 1212 directs square shaped electron beam 1222 to impinge area 2002 so that a portion of the cross section of electron beam 1222 which traverses lower aperture 1214B matches shape 1908. To generate shape 1910, upper deflector 1212 directs square shaped electron beam 1222 to impinge area 2004 so that a portion of the cross section of electron beam 1222 which traverses the lower aperture 1214B matches shape 1910.

As stated above, electron beam column 112 performs the beam blanking operation by directing electron beam 122 onto blanking position 1904 of lower aperture 1214A. In this embodiment, when shaper/blanker driver 110 applies approximately no voltage to nodes 1606, 1608, 1604, and 1610 of upper deflector 1212, electron beam 1222 is incident on blanking position 1904. Thus beam blanking occurs without electron beam 1222 traversing an open area of lower aperture 1214A. However, for lower aperture 1214B, electron beam 1222 traverses opening 1502 when shaper/blanker driver 110 applies no voltage to upper deflector 1212. In order to blank electron beam 122, upper deflector 1212 deflects the path of electron beam 1222 so that a solid portion of lower aperture 1214B (the specific blocking portion is not depicted) blocks the path of electron beam 1222. However, electron beam 1222 must traverse opening 1502 in lower aperture 1214B before blockage by a solid portion of lower aperture 1214B. The traversal of an opening in beam blanking introduces undesirable dose error. Thus a lower dose error is associated with lower aperture 1214A than lower aperture 1214B. Lower aperture 1214A also allows for faster beam blanking than lower aperture 1214B because there is no delay incurred from altering the path of electron beam 1222 to cause beam blanking.

It is desirable that different portions of lower aperture 1214 are used to shape electron beam 1222, in order to distribute electron beam heating of the aperture. For example exposure of a full, square shaped flash field is very common. Referring to FIG. 14A, to distribute heating of lower aperture 1214A when generating a full flash field, a square shaped cross section of electron beam 1222 is shaped using e.g. corners 1404, 1406, 1408, or 1410. Similarly, referring to FIG. 15A, when generating a full flash field, a cross section of electron beam 1222 is shaped using e.g. corners 1504, 1506, or 1512. A similar heat distribution scheme can be applied to other electron beam cross section shapes.

The above-described embodiments are illustrative and not limiting. It will thus be obvious to those skilled in the art that various changes and modifications may be made without departing from this invention in its broader aspects. For example, the distances between and dimensions of the components within electron beam column 112 such as upper aperture 1210, lower aperture 1214A, or lower aperture 1214B can be optimized for larger or smaller minimum device features. The openings defined by upper aperture 1210, lower aperture 1214A, and lower aperture 1214B can be altered. Flash fields can be other than 2 pixels by 2 pixels. The plates of lower deflector 1216 and upper deflector 1212 may be thin metal rods. Therefore, the appended claims encompass all such changes and modifications as fall within the scope of this invention.

Appendices A and B, which are part of the present disclosure, include an article and a computer program listing, which are copyrighted. The copyright owner, ETEC Systems Inc., has no objection to the facsimile reproduction by anyone of the patent document or the present disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyrights whatsoever.

APPENDIX A Run-time Correction of Proximity Effects in Raster Scan Pattern Generator Systems

L. Veneklasen, U. Hofmann, L. Johnson, V. Boegli, and R. Innes Etec Systems Inc., 26460 Corporate Avenue, Hayward, Calif. 94545 USA

Abstract—In a raster scan lithography system, it is possible to correct proximity effects while the pattern is being exposed. This technique will be called run-time proximity correction. It is quite different from conventional methods that preprocess pattern data before exposure. It is a user transparent part of an exposure strategy whose throughput does not depend on the density, complexity, or randomness of the pattern. Coverage parameters are calculated, accounting for short-range aerial-image edge resolution and forward scattering effects which distort the shape of features, as well as backscattering effects which shift exposure levels with pattern density. These parameters are used to modify pixel doses according to precalculated and preloaded algorithms that equalize the exposure of all edges. This paper formulates the physical and mathematical basis for the technique, describes one of many possible correction algorithms, and explains how rasterized data is dose modulated in real time using digital signal process. ors and lookup table memory.

1. RASTER SCAN STRATEGY

A raster scan—system periodically scans the beam in one axis while the substrate is moved uniformly along the other axis. The pattern is composed on a regular grid. Each grid site, or pixel, is available for exposure during a fixed flash period, where the pixel may be exposed by a beam of approximately the same size as the pixel. Because the scan trajectory is periodic, the exposure rate depends on pixel size and flash rate, but not on the density and complexity of the pattern.

Advanced systems use a graybeam data format specifying the fraction of pattern area within each pixel. Edges may be shifted in subpixel increments using combinations of dose modulation and subpixel beam deflection, allowing a pattern to be composed on an address grid that is much finer than the pixel grid. If subpixel deflection is used to shift edges, dose modulation may be exploited to correct proximity effects.

The volume of graybeam data for the entire pattern is much too large to store in memory. Pattern data is rasterized during exposure, starting from a more compact description of features and ending with smaller segments of the graybeam map stored in buffer memory in the required exposure sequence.

It is remarkably easy to manipulate pattern data after it has been rasterized. If corrections can be generated at the pixel rate, correction will not increase the exposure time regardless of the density, complexity or randomness of the pattern. A rasterized data format makes run-time proximity correction possible.

2. THEORETICAL FORMULATION 2.1. Dose and Exposure

We define a binary pattern distribution function P(r) at location vector r within the pattern. We define an unknown dose multiplier function d(r), which will modify dose to correct proximity effects. Dose is restricted to the area of the original pattern so that the ideal corrected dose function is d(r)P(r). We also define an exposure function E(r), specifying the energy density deposited in the resist, which is the latent image defining the pattern for subsequent processing steps.

P, d, and E are dimensionless ratios, normalized so that a uniform nominal dose, P=1 and d=1, results in a deposited energy density of E=1. A dose of dp=1 means that the dose in units of μC/cm² is equal to the nominal dose for a uniform exposure. Similarly, E=1 means that the deposited energy (Joules/cm³) is equal to the energy density deposited by this uniform dose.

A spread function φ(r′)=[1/(1+η)][κ(r′)+ηβ(r′)] describes the absorbed energy contribution from a pixel a distance r′ away from r. The short range function κ(r′) specifies the combined (indistinguishable) effects of beam blur and forward scattering² with a range comparable to the pixel size (typically 0.1 μm). The long range function β(r′) specifies exposure from backscattered electrons with a much larger range (typically 10 μm for 50 kV on silicon or glass). The constant η is the ratio of energy deposited by backscattered electrons to that deposited by forward scattered primary exposure. Spread functions do not have to be Gaussian, but they are separately normalized so that ∫κ(r′)dr′=1 and ∫β(r′)dr′=1. The factor [1/(1+η)] assures that the combined spread function φ(r′) is also normalized so that ∫φ(r′)dr′=1.

The deposited energy distribution E(r) is the convolution of the dose function with the spread function φ:

E(r)=∫d(r−r′)P(r−r′)φ(r′)dr′=[1/(1+η)][(dP*κ)_(r)+η(dP*β)_(r) ]J  (1)

Notation (*)_(r) will subsequently indicate a convolution evaluated at position r.

2.2. Isofocal Edge Exposure Criteria

We specify a process threshold E_(p) where, in the limit of a high resist contrast, all resist with deposited energy density exceeding this threshold is removed, and all resist with lower deposited energy density remains. FIG. 21-1 shows the normalized absorbed energy profile near an edge.

There is an optimum process threshold E_(p) for proximity corrected patterns. Exactly at the edge, the symmetric spread function φ is integrated over only half of an infinite plane, so when the dose is one, the deposited energy E=(P*φ)_(e)=½. If the process threshold E_(p) also equals ½, the edge of the resist pattern will be at its intended position, regardless of the width or shape of the spread function. This is also the point where the smallest lines and spaces can be printed. E_(p)=½ is called the isofocal exposure threshold.

The deposited energy gradient is dE/dr, which is proportional to the exposure contrast. The inverse of the deposited energy gradient dr/dE (nm/% dose error) measures the sensitivity of the edge position to dose errors. The gradient is maximum at E_(p)=½ so edge placement errors will be minimum. To retain the advantage of high contrast, edge shifts due to process bias must be compensated by edge shifts (bias) in the original pattern instead of shifts in the exposure level. Thus the goal of proximity correction is to expose all pattern edges isofocally with maximum exposure contrast.

FIG. 21-1. Edge shifts due to dose and process errors are minimized when the deposited energy gradient is maximum and the exposure is isofocal.

The integral equation for dose multiplier functions d that will expose all edges at the isofocal threshold is:

E _(e)=½=[1/(1+η)][(dP*κ)_(e)+(dP*β)_(e)]  (2)

(at all feature edges)

Except for normalization factors, this is the basic correction condition used in references,^(3,4) which apply an edge exposure criteria instead of trying to equalize the exposure inside features.

The convolutions in Equation 2 are called weighted pattern coverage functions. Run-time proximity correction calculates these convolutions from the original pattern data and uses them as variables in the correction algorithms.

2.3. Long-range Correction Algorithm

Integral Equation 2 is defined only at the edges. There are many possible solutions for the dose function dP, depending on the deposited energy density allowed inside features. Clever superposition or pattern functions can maximize exposure contrast as well as equalize edge exposure.

The simplest correction for long-range effects starts with the original pattern and applies the solution to Equation 2 at all points within the pattern. The solution takes the form of a recursion relationship^(2,4) that approaches the correct dose modulating function. We let the (n+1)th iteration be the (n)th iteration multiplied by a slowly varying function ƒ^((n+1)) (that approaches one as the solution converges), so that d^(n+1))=ƒ^((n+1))d^((n)). When there is no significant interaction between feature edges, (dP*κ)≈d/2 at the edge. Inserting dose ƒ^((n+1))d^((n)) into the isofocal criteria, we find that (1+η)=ƒd^((n))+2η(ƒd^((n))P*β. The slowly varying function ƒ is factored out of the convolution integral, making the approximation ƒd^((n))P*β≈ƒ(d^((n))P*β). Solving for ƒ, and multiplying it by d^((n)), we obtain the following recursion relationship:

d ^((n+1)) P={(1+η)d ^((n)) /[d ^((n))+2η(d ^((n)) P*β)]}p  (3)

(recursion relationship)

The first order approximation³ is found letting the zero order approximation d⁽⁰⁾=1.

d ⁽¹⁾ P={(1+η)/[1+2η(P*β)]}P  (4)

(first order approximation)

The first order solution is exact only when pattern Coverage is constant.^(2, 5) The iterative solution accounts for backscattering from the correction dose in a way that can easily be implemented in the electronics.

2.4. Short-range Correction Algorithm

Short-range effects within a feature may be corrected by modifying the dose of edge pixels. We superimpose a pattern P_(e) describing a one pixel wide border inside the edge of features and assign it a dose multiplier function d_(e) so that the new total dose is d₁P+d_(e)P_(e). Inserting this pattern into Equation 2 and ignoring backscatter from the edge dose, we find that d_(e)=[1−2(P*κ)_(e)di/2(P_(e)*κ)_(e). Note that coverages (P*κ)_(e) and (P_(e)*κ)_(e) are evaluated at the outside edges of edge pixels. The complete dose modulation algorithm is:

d(x)P(x)=d _(i) [P(x)+{[1−2(P*κ)_(e)]/2(P _(e)*κ)_(e) P _(e)(x)]  (5)

2.5. Simulations

FIG. 21-2 is a Mathematica simulation comparing an uncorrected exposure with an exposure using this dose modulated algorithm. The test pattern⁴ includes a 0.5 μm-wide isolated line, an isolated gap, and a line next to a wide exposed pad (with a high coverage gradient to emphasize second order errors). Gaussian spread functions with a 0.2 μm short-range interaction, a 4 μm backscatter interaction, and an η of 0.6 are chosen to make details visible in the illustration. The simulation is not intended to model a real system.

2a. Uncorrected d=P 2b. Corrected FIG. 21-2. Simulated Deposited Energy Distributions Using a Dose-modulated Edge-enhanced Algorithm

Critical dimension (CD) errors appear as variations in the width of the deposited energy profile where it crosses the development threshold. Correction almost equalizes E_(e) at the edges, making CDs nearly independent of local pattern density.

Other simulations showed that correction reduced backscatter effects by more than a factor of 10×. Short-range correction extended the minimum useable feature size (<5% CD error) to below the nominal edge resolution.

3. RUN-TIME CORRECTION ELECTRONICS

FIG. 21-3 shows a new pipeline processor that transforms rasterized pattern data for dose modulated proximity correction.

Input data from the rasterizer is an array of numbers P_(i) representing the percent of pattern covering each pixel on a discrete grid. This data is generated in the rasterizer electronics, stored in input buffer memory, and read to the proximity correction system in the required sequence. The correction processors calculate short-range coverage (P*κ)_(i) and long-range coverage (dP*β)_(i) for each pixel. These numbers serve as address for a lookup table² containing dose multipliers that have been precalculated and preloaded ahead of time. Corrected dose data d_(i)P_(i). is loaded into an output buffer and fed to the exposure system with the original pattern data P_(i) as beam-blanking and subpixel-detection instructions.

FIG. 21-3. Block Diagram of Correction Electronics 3.1. Short-range Processor

Short-range effects involve only a small number of neighboring pixels. The short-range processor is essentially a lookup table storing a precalculated coverage value for each unique combination of near neighbor pixel data. Local patterns are fairly simple, and translations, rotations, and reflections do not influence coverage, so the number of unique near neighbor configurations is not too large. Edge, line, and corner configurations may be recognized, and a reasonably sized lookup table may be preprogrammed for each case.

FIG. 21-4 shows how a corner pixel may be recognized from its surrounding context and intentionally overdosed to shift its boundary outwards to minimize corner rounding and tapering at the end of narrow lines. The location of the sampling point for (P*κ) and (P_(e)*κ)_(e). used to precalculate the dose multiplier that determines where the edge will print.

FIG. 21-4. Short-range Corrections Using Near Neighbor Pixel Data 3.2. Long-range Processor

Long-range backscattering involves a much larger number of neighboring pixels, making lookup table techniques impractical. FIG. 21-5 shows a segment of a scan stripe surrounding a pixel whose coverage is to be evaluated. The range of the backscatter spread function β is also shown.

To calculate long-range coverages (dP*β)_(i), the pattern is subdivided into larger cells with index k. Each cell contains many pixels. The long-range spread function is approximated by a matrix of precalculated numbers β_(k·k0) stored in the processor.

Cell size is chosen so that this stepwise spread function is a reasonable approximation of the actual spread function. Cells are much smaller than the backscatter range (in contrast to the pattern area density technique³, but they contain many pixels.

The electronics calculates coverage in two steps. Using a dedicated adder circuit operating at the pixel exposure frequency, pixel data for each cell is summed to obtain the average pattern density P_(k)=dk⁽⁰⁾=(1/I²)Σ_(±½)P_(ik.) This data forms a coarse coverage map stored in a much smaller buffer memory. This gray level map is the zero order representation of the pattern for iterative calculation of a backscatter correction dose multiplier d_(k).

FIG. 21-5. Calculating Long-range Coverage Using Cells

The coarse coverage map is then convolved with the spread function in the calculation is (d*β)_(k0)≈(1/K²)Σ_(±K/2)d_(k)β_(k·k0). The output is a map of (d^((n−1))*β)_(k0), which is used as lookup data to find the appropriate dose multiplier d_(k) ^((n)). If necessary, this data may then be used as new coarse coverage data for the next iteration d_(k) ^((n+1)). When the calculation is complete, correction d_(k) is applied to all pixels within the cell.

The following calculation suggests that the convolutions may be performed in run time. A 1.0 mm square scan field segment might contain a 10,240×10,240 array of 0.1 μm square pixels. At 160 MHz pixel rate (6.25 nsec per pixel exposure), this field is scanned in 0.65 sec. During this time, the long-range processor must perform one or more iterative convolutions on the next data segment.

We divide the field into a 1024×1024 array of 1 μm square cells, each containing 100 pixels. These cells are much smaller than the scattering range of higher voltage electrons, so there should be little quantization error associated with the cells.

TV bandwidth video processors using fast Fourier transforms perform the convolution with a minimum number of operations. A modem digital signal processor (DSP) can transform, multiply by the kernel, and reverse transform (convolve) this 1024×1024 field in less than 0.2 sec.⁶ This is less than ⅓ of the available time, so there should be enough time for three iterations. Thus we believe that computation rate requirements can be met with available electronics.

4. SUMMARY

We have described a run-time proximity correction method for raster scan lithography systems. Starting with uncorrected pattern data in a rasterized graybeam format, correction is simultaneous with exposure. Correction does not reduce system throughput, regardless of pattern complexity. A suitable dose modulation algorithm based on equalization of edge exposure is derived. An electronic system using DSP convolvers and lookup table memory to execute both short- and long-range correction is presented. We believe that run-time correction is possible in a raster scan system.

REFERENCES

1. L. Muray, F. Abboud, F. Raymond, and C. N. Berglund. J. Vac. Sci. Technol. B 12, p. 3465 (1994).

2. T. R. Groves, J. Vac. Sci. Technol., B 11, p. 2746 (1993).

3. F. Murai, H. Yoda, S. Okazaki, N. Saito, and Y. Sakitani, J. Vac. Sci. Technol., B 10, p. 3072 (1992).

4. G. Watson, L. Fetter, and J. A. Liddle, J. Vac.Sci. Technol., B 15, p. 2308 (1997).

5. T. Dasuga, M. Konishi. T. Oda, and S. Moriya, J. Vac. Sci. Technol., B 14, p. 3870 (1996).

6. J. Sgro, DSP and Multimedia Technology, May/June (1996). 

What is claimed is:
 1. A flash converter that determines shape data that specifies a flash field among pixels, said pixels being represented as gray level values that represent a proportion of each said pixel that overlaps with a pattern, comprising: a reformatter which constructs a matrix of a quadrant and surrounding pixels of said flash field, wherein said reformatter modifies said matrix so that N intermediate shapes correspond to an exposed region of said quadrant are provided; and a shape code determinator coupled to receive said modified matrix and which determines an intermediate shape data that specifies said quadrant, wherein said shape code determinator performs a reverse modification on said intermediate shape data and outputs shape data that specifies said flash field.
 2. The flash converter of claim 1 wherein N is
 3. 3. The flash converter of claim 1 further comprising: a first lookup table coupled to said shape code determinator, wherein said shape code determinator further determines an intermediate shape code and intermediate coordinates associated with said intermediate shape data from said first lookup table.
 4. The flash converter of claim 3 wherein said first lookup table stores entries of said intermediate shape code and intermediate coordinates associated with gray level values of pixels among said modified matrix.
 5. The flash converter of claim 4 wherein said intermediate shape code and intermediate coordinate entries correspond to a minimum gray level value error for said quadrant and selected pixels among said modified matrix.
 6. The flash converter of claim 1 further comprising a second lookup table coupled to said shape code determination circuit wherein said shape code determination circuit further determines a shape code and coordinates associated with said shape data from said second lookup table.
 7. The flash converter of claim 6 wherein said second lookup table stores entries of said shape code and coordinates for each combination of reverse modification operations on said intermediate shape data.
 8. The flash converter of claim 1 wherein said flash converter constructs each said matrix in a raster scan.
 9. A method for determining shape data that specifies a flash field among pixels, said pixels being represented as gray level values that represent a proportion of said pixel that overlaps with a pattern, comprising the acts of: constructing a matrix of a quadrant and surrounding pixels; modifying said matrix so that N intermediate shapes corresponding to an exposed region of said quadrant are provided; determining an intermediate shape data that specifies said quadrant; and performing a reverse modification on said intermediate shape data to determine said shape data that specifies said flash field.
 10. The method of claim 9 wherein N is
 3. 11. The method of claim 9 wherein said determining an intermediate shape data further includes the act of looking up an intermediate shape code and intermediate coordinates.
 12. The method of claim 11 wherein said looking up comprises the act of looking up of said intermediate shape code and intermediate coordinates associated with gray level values of pixels of said modified matrix.
 13. The method of claim 12 wherein said intermediate shape code and intermediate coordinates correspond to a minimum gray level value error for said quadrant and selected pixels of said modified matrix.
 14. The method of claim 9 wherein performing a reverse modification on said intermediate shape data to determine said shape data further includes the act of looking up a shape code and coordinates.
 15. The method of claim 14 wherein said looking up comprises the act of looking up said shape code and coordinates for a combination of reverse modification operations on said intermediate shape data.
 16. The method of claim 9 wherein said constructing further comprises the act of constructing a matrix of a quadrant and surrounding pixels in a raster scan. 